Multiply-Imputed Synthetic Data: Advice to the Imputer
نویسندگان
چکیده
منابع مشابه
Bayesian Estimation of Disclosure Risks for Multiply Imputed, Synthetic Data
Many national statistical agencies, survey and research organizations, and businesses— henceforth all called agencies—collect data that they intend to share with others. These agencies strive to release data that (i) protect the confidentiality of data subjects’ identities and sensitive attributes, (ii) are informative for a wide range of analyses, and (iii) are relatively straightforward for s...
متن کاملOrder selection tests with multiply imputed data
Nonparametric tests for the null hypothesis that a function has a prescribed form are developed and applied to data sets with missing observations. Omnibus nonparametric tests such as the order selection tests, do not need to specify a particular alternative parametric form, and have power against a large range of alternatives. More specifically, likelihood-based order selection tests are defin...
متن کاملObtaining Predictions from Models Fit to Multiply Imputed Data
Obtaining predictions from regression models fit to multiply imputed data can be challenging because treatments of multiple imputation seldom give clear guidance on how predictions can be calculated, and because available software often does not have built-in routines for performing the necessary calculations. This research note reviews how predictions can be obtained using Rubin’s rules, that ...
متن کاملAnalysis of Variance from Multiply Imputed Data Sets
The analysis of variance is a popular method used in many scientific applications. There are standard software for handling unbalanced data due to missing values in the outcome/dependent variable. The analysis becomes difficult when the missing values are in predictors. Multiple imputation is an increasingly popular method for handling such incomplete data. This approach involves replacing the ...
متن کاملDifferential Network Analysis with Multiply Imputed Lipidomic Data
The importance of lipids for cell function and health has been widely recognized, e.g., a disorder in the lipid composition of cells has been related to atherosclerosis caused cardiovascular disease (CVD). Lipidomics analyses are characterized by large yet not a huge number of mutually correlated variables measured and their associations to outcomes are potentially of a complex nature. Differen...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Official Statistics
سال: 2017
ISSN: 2001-7367
DOI: 10.1515/jos-2017-0047